The heat equation is an important partial differential equation which describes the distribution of heat (or variation in temperature) in a given region over time. For a function u(x,y,z,t) of three spatial variables (x,y,z) and the time variable t, the heat equation is
also written
or sometimes
where is a constant and or denotes the Laplacian operator. For the mathematical treatment it is sufficient to consider the case α=1.
The heat equation is of fundamental importance in diverse scientific fields. In mathematics, it is the prototypical parabolic partial differential equation. In probability theory, the heat equation is connected with the study of Brownian motion via the Fokker–Planck equation. In financial mathematics it is used to solve the Black–Scholes partial differential equation. The diffusion equation, a more general version of the heat equation, arises in connection with the study of chemical diffusion and other related processes.
Contents |
Suppose one has a function u which describes the temperature at a given location (x, y, z). This function will change over time as heat spreads throughout space. The heat equation is used to determine the change in the function u over time. The image to the right is animated and describes the way heat changes in time along a metal bar. One of the interesting properties of the heat equation is the maximum principle which says that the maximum value of u is either earlier in time than the region of concern or on the edge of the region of concern. This is essentially saying that temperature comes either from some source or from earlier in time because heat permeates but is not created from nothingness. This is a property of parabolic partial differential equations and is not difficult to prove mathematically (see below).
Another interesting property is that even if u has a discontinuity at an initial time t = t0, the temperature becomes smooth as soon as t > t0. For example, if a bar of metal has temperature 0 and another has temperature 100 and they are stuck together end to end, then very quickly the temperature at the point of connection is 50 and the graph of the temperature is smoothly running from 0 to 100.
The heat equation is used in probability and describes random walks. It is also applied in financial mathematics for this reason.
It is also important in Riemannian geometry and thus topology: it was adapted by Richard Hamilton when he defined the Ricci flow that was later used by Grigori Perelman to solve the topological Poincaré conjecture.
The heat equation is derived from Fourier's law and conservation of energy (Cannon 1984). By Fourier's law, the flow rate of heat energy through a surface is proportional to the negative temperature gradient across the surface,
where k is the thermal conductivity and u is the temperature. In one dimension, the gradient is an ordinary spatial derivative, and so Fourier's law is
where ux is du/dx. In the absence of work done, a change in internal energy per unit volume in the material, ΔQ, is proportional to the change in temperature, Δu. That is,
where cp is the specific heat capacity and ρ is the mass density of the material. (In this section only, Δ is the ordinary difference operator, not the Laplacian.) Choosing zero energy at absolute zero temperature, this can be rewritten as
The increase in internal energy in a small spatial region of the material
over the time period
is given by[1]
where the fundamental theorem of calculus was used. Additionally, with no work done and absent any heat sources or sinks, the change in internal energy in the interval [x-Δx, x+Δx] is accounted for entirely by the flux of heat across the boundaries. By Fourier's law, this is
again by the fundamental theorem of calculus.[2] By conservation of energy,
This is true for any rectangle [t−Δt, t+Δt] × [x−Δx, x+Δx]. Consequently, the integrand must vanish identically:
Which can be rewritten as:
or:
which is the heat equation. The coefficient k/(cpρ) is called thermal diffusivity and is often denoted α.
In the special case of heat propagation in an isotropic and homogeneous medium in a 3-dimensional space, this equation is
where:
The heat equation is a consequence of Fourier's law of cooling (see heat conduction).
If the medium is not the whole space, in order to solve the heat equation uniquely we also need to specify boundary conditions for u. To determine uniqueness of solutions in the whole space it is necessary to assume an exponential bound on the growth of solutions, this assumption is consistent with observed experiments.
Solutions of the heat equation are characterized by a gradual smoothing of the initial temperature distribution by the flow of heat from warmer to colder areas of an object. Generally, many different states and starting conditions will tend toward the same stable equilibrium. As a consequence, to reverse the solution and conclude something about earlier times or initial conditions from the present heat distribution is very inaccurate except over the shortest of time periods.
The heat equation is the prototypical example of a parabolic partial differential equation.
Using the Laplace operator, the heat equation can be simplified, and generalized to similar equations over spaces of arbitrary number of dimensions, as
where the Laplace operator, Δ or , the divergence of the gradient, is taken in the spatial variables.
The heat equation governs heat diffusion, as well as other diffusive processes, such as particle diffusion or the propagation of action potential in nerve cells. Although they are not diffusive in nature, some quantum mechanics problems are also governed by a mathematical analog of the heat equation (see below). It also can be used to model some phenomena arising in finance, like the Black-Scholes or the Ornstein-Uhlenbeck processes. The equation, and various non-linear analogues, has also been used in image analysis.
The heat equation is, technically, in violation of special relativity, because its solutions involve instantaneous propagation of a disturbance. The part of the disturbance outside the forward light cone can usually be safely neglected, but if it is necessary to develop a reasonable speed for the transmission of heat, a hyperbolic problem should be considered instead – like a partial differential equation involving a second-order time derivative.
The function u above represents temperature of a body. Alternatively, it is sometimes convenient to change units and represent u as the heat density of a medium. Since heat density is proportional to temperature in a homogeneous medium, the heat equation is still obeyed in the new units.
Suppose that a body obeys the heat equation and, in addition, generates its own heat per unit volume (e.g., in watts/L) at a rate given by a known function q varying in space and time.[3] Then the heat per unit volume u satisfies an equation
For example, a tungsten light bulb filament generates heat, so it would have a positive nonzero value for when turned on. While the light is turned off, the value of for the tungsten filament would be zero.
The following solution technique for the heat equation was proposed by Joseph Fourier in his treatise Théorie analytique de la chaleur, published in 1822. Let us consider the heat equation for one space variable. This could be used to model heat conduction in a rod. The equation is
where u = u(x, t) is a function of two variables x and t. Here
We assume the initial condition
where the function f is given and the boundary conditions
Let us attempt to find a solution of (1) which is not identically zero satisfying the boundary conditions (3) but with the following property: u is a product in which the dependence of u on x, t is separated, that is:
This solution technique is called separation of variables. Substituting u back into equation (1),
Since the right hand side depends only on x and the left hand side only on t, both sides are equal to some constant value − λ. Thus:
and
We will now show that solutions for (6) for values of λ ≤ 0 cannot occur:
This solves the heat equation in the special case that the dependence of u has the special form (4).
In general, the sum of solutions to (1) which satisfy the boundary conditions (3) also satisfies (1) and (3). We can show that the solution to (1), (2) and (3) is given by
where
The solution technique used above can be greatly extended to many other types of equations. The idea is that the operator uxx with the zero boundary conditions can be represented in terms of its eigenvectors. This leads naturally to one of the basic ideas of the spectral theory of linear self-adjoint operators.
Consider the linear operator Δ u = ux x. The infinite sequence of functions
for n ≥ 1 are eigenvectors of Δ. Indeed
Moreover, any eigenvector f of Δ with the boundary conditions f(0)=f(L)=0 is of the form en for some n ≥ 1. The functions en for n ≥ 1 form an orthonormal sequence with respect to a certain inner product on the space of real-valued functions on [0, L]. This means
Finally, the sequence {en}n ∈ N spans a dense linear subspace of L2(0, L). This shows that in effect we have diagonalized the operator Δ.
In general, the study of heat conduction is based on several principles. Heat flow is a form of energy flow, and as such it is meaningful to speak of the time rate of flow of heat into a region of space.
Thus the rate of heat flow into V is also given by the surface integral
where n(x) is the outward pointing normal vector at x.
By Green's theorem, the previous surface integral for heat flow into V can be transformed into the volume integral
Putting these equations together gives the general equation of heat flow:
Remarks.
A fundamental solution, also called a heat kernel, is a solution of the heat equation corresponding to the initial condition of an initial point source of heat at a known position. These can be used to find a general solution of the heat equation over certain domains; see, for instance, (Evans 1998) for an introductory treatment.
In one variable, the Green's function is a solution of the initial value problem
where δ is the Dirac delta function. The solution to this problem is the fundamental solution
One can obtain the general solution of the one variable heat equation with initial condition u(x,0) = g(x) for -∞<x<∞ and 0<t<∞ by applying a convolution:
In several spatial variables, the fundamental solution solves the analogous problem
in -∞<xi<∞, i=1,...,n, and 0<t<∞. The n-variable fundamental solution is the product of the fundamental solutions in each variable; i.e.,
The general solution of the heat equation on Rn is then obtained by a convolution, so that to solve the initial value problem with u(x,t=0)=g(x), one has
The general problem on a domain Ω in Rn is
with either Dirichlet or Neumann boundary data. A Green's function always exists, but unless the domain Ω can be readily decomposed into one-variable problems (see below), it may not be possible to write it down explicitly. The method of images provides one additional technique for obtaining Green's functions for non-trivial domains.
A variety of elementary Green's function solutions in one-dimension are recorded here. In some of these, the spatial domain is the entire real line (-∞,∞). In others, it is the semi-infinite interval (0,∞) with either Neumann or Dirichlet boundary conditions. One further variation is that some of these solve the inhomogeneous equation
where f is some given function of x and t.
Comment. This solution is the convolution in , that is with respect to both the variables and of the fundamental solution and the function both meant as defined on the whole and identically 0 for all . One verifies that which is expressed in the language of distributions as where the distribution is the Dirac's delta function, that is the evaluation at 0.
Comment. This solution is obtained from the preceding formula as applied to the data suitably extended to , so as to be an odd function of the variable that is, letting for all and Correspondingly, the solution of the inhomogeneous problem on is an odd function with respect to the variable for all values of and in particular it satisfies the homogeneous Dirichlet boundary conditions
Comment. This solution is obtained from the first formula as applied to the data suitably extended to , so as to be an even function of the variable that is, letting for all and Correspondingly, the solution of the inhomogeneous problem on is an even function with respect to the variable for all values of and in particular, being a smooth function, it satisfies the homogeneous Neumann boundary conditions
Since the heat equation is linear, solutions of other combinations of boundary conditions, inhomogeneous term, and initial conditions can be found by taking an appropriate linear combination of the above Green's function solutions.
For example, to solve
let
where u and v solve the problems
Similarly, to solve
let
where w, v, and r solve the problems
Solutions of the one-dimensional heat equation on a finite spatial interval (0,1) involve the Jacobi theta function, defined by[4]
where
is the fundamental solution.
By the method of images, the theta function gives Green's functions for the initial boundary value problems on the interval (0,1). For instance, the solution to the following problem
is given by
Solutions of the heat equations satisfy a mean-value property analogous to the mean-value properties of harmonic functions (solutions of ), though a bit more complicated. Precisely, if solves and then
where is a "heat-ball", that is a super-level set of the fundamental solution of the heat equation:
Notice that as so the above formula holds for any in the (open) set for large enough. Conversely, any function u satisfying the above mean-value property on an open domain of is a solution of the heat equation. This can be shown by an argument similar to the analogous one for harmonic functions.
One can model particle diffusion by an equation involving either:
In either case, one uses the heat equation
or
Both c and P are functions of position and time. D is the diffusion coefficient that controls the speed of the diffusive process, and is typically expressed in meters squared over second. If the diffusion coefficient D is not constant, but depends on the concentration c (or P in the second case), then one gets the nonlinear diffusion equation.
The random trajectory of a single particle subject to the particle diffusion equation (or heat equation) is a Brownian motion. If a particle is placed at at time , then the probability density function associated with the position vector of the particle will be the following:
which is a (multivariate) normal distribution evolving in time.
With a simple division, the Schrödinger equation for a single particle of mass m in the absence of any applied force field can be rewritten in the following way:
This equation is formally similar to the particle diffusion equation, which one obtains through the following transformation:
Applying this transformation to the expressions of the Green functions determined in the case of particle diffusion yields the Green functions of the Schrödinger equation, which in turn can be used to obtain the wavefunction at any time through an integral on the wavefunction at t=0:
Remark: this analogy between quantum mechanics and diffusion is a purely formal one. Physically, the evolution of the wavefunction satisfying Schrödinger's equation might have an origin other than diffusion.
The heat equation arises in the modeling of a number of phenomena and is often used in financial mathematics in the modeling of options. The famous Black–Scholes option pricing model's differential equation can be transformed into the heat equation allowing relatively easy solutions from a familiar body of mathematics. Many of the extensions to the simple option models do not have closed form solutions and thus must be solved numerically to obtain a modeled option price. The heat equation can be efficiently solved numerically using the Crank–Nicolson method of (Crank & Nicolson 1947). This method can be extended to many of the models with no closed form solution, see for instance (Wilmott, Howison & Dewynne 1995).
An abstract form of heat equation on manifolds provides a major approach to the Atiyah–Singer index theorem, and has led to much further work on heat equations in Riemannian geometry.